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Because  (inclusion)  balls  are  used,  it  is  convenient  to  call  the  method  a  ball 
N'ewton  method.  Accordingly  the  "usual'*  Mewton  method  in  the  following  will  be  called 
a  point  Newton  method. 

Tne  basic  idea  of  the  following  method  is  the  use  of  sets  (balls)  instead  of 
points.  This  idea  originates  in  interval  mathematics.  The  first  interval  Newton 
method  was  given  by  R.  E.  Moore  in  14].  It  is  easy  to  show  that  the  Moore  method  is 
globally  convergent  for  n  *  1.  This  was  shown  in  [5]  for  the  first  time.  In  the 
meantime,  many  interval  variants  of  the  Newton  method  have  been  investigated.  A 
survey  for  n  =  1  has  been  given  by  VJ.  J.  Beiser  (2].  It  seems,  however,  that  in 
the  case  n  ^  1  no  globally  convergent  interval  Newton  method  has  been  found.  There 
is  indeed  a  criterion  by  Alefeld-Herzberger  [1]  which  ensures  global  convergence  for 
Moore's  version.  Nevertheless  this  criterion  is  rather  unwieldy  and  is  not  satisfied 
in  most  practical  cases. 

For  complex  valued  functions  f  :  B  _  {t  -*•  (C,  complex  discs  can  be  used 
instead  of  complex  intervals  for  the  definition  of  a  disc  Mewton  method.  Vt\is  was 
done  by  P.  Henrici  [3]  but  no  global  convergence  could  be  proven  yet.  It  will  be 
shown  that  the  complex  problem  can  be  treated  as  a  special  case  of  the  following 
general  theory. 

In  order  to  define  the  point  Newton  method  in  general  the  following  two 
assumptions  are  made: 

exists  on  B  ,  (6) 

( ‘  * )  ^  exists  on  B  .  (7) 

In  contrast  to  (b),  no  differentiability  of  f  on  B  is  demanded  for  the  new 
mt  t'lod.  Instead  of  (C>),  it  sviffices  to  assume  the  validity  of  a  certain  inclusion 
Mpschitz  condition.  Tiis  condition  can  lx?  formulated  especially  simply  for  the 
ca.-?'-  n  -  2  and  for  B  _  I.  It  tlicn  roads 

f(x)  -  f(y)  3(x  -  v)  for  x,  y  ■  B  .  (8) 
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Herein  S  :=  iz  •  Oil's  -  z!  _  ojs'.'  is  a  comf^lox  disc  with  midcoint  .c.  :  r-  . 

a]sl.  It  is  assurned  that  0  <  o  <  1,  i.o. 

0  ^  S  . 

These  conditions  (6),  (7)  for  the  point  method  and  (B)  ,  (9)  for  the  hall  ne  tood  .ar. 

be  compared.  On  the  one  hand,  the  assumption  (3)  is  weaker  than  (6),  ix-cause  f  i:. 
(8)  does  not  even  have  to  be  differentiable.  On  the  other  hand,  (B)  and  (j)  arf. 
stronqer  than  (6),  (7)  because  the  data  S,  j  have  to  be  known  exf.'licitl-/  arid 
because  (6),  (8)  and  (9)  already  imply  (7). 

n 

The  conditions  to  be  imposed  on  f  in  IR  for  n  2  are  still  quite  weak. 
They  are  not  nearly  as  restrictive  as  the  Kantorovich  condition  (see  Ortega-rL'ieinbol  1 
[7],  p.  421).  Even  if  f  is  analytic  t’'dt  condition  is  true  in  general  only  i.n  a 
very  small  neighbourhood  of  a  zero. 

In  what  follows  the  following  problem  will  not  be  considered:  :iveii  a 
f  :  G  find  all  zeros  of  f  in  G.  If  there  is  more  than  one  zero  in  "  a.ni 

if  G  is  convex  then  there  are  in  general  {joints  in  G  where  is  nc*-  ir.vc-j^i. 

This  can  be  shown  already  in  the  case  n  =  1  by  the  mean  value  theorem.  Hence  at 
these  points  the  point  Newton  method  (1),  (2)  cannot  be  applied.  Therefore  the  Newto 
method  always  has  to  be  combined  with  some  other  method  if  all  the  zeros  of  f  on 
B  are  to  be  found.  Actually,  the  new  proposed  ball  Newton  method  also  could  be 
used  for  the  solution  of  this  more  general  problem.  It  not  only  guarantees  the 
inclusion  z  e  for  v  =  0,  1,  *  *  '  but  it  also  gives  information  rogardin? 

exclusions  of  the  kind  z  4  M,  where  M  is  an  appropriate  set.  With  interval 
Newton  methods  these  ideas  have  already  been  used  successfully  for  the  de terminal irr 
of  all  zeros  of  f  in  G,  especially  in  the  case  n  =  1.  There  will  be  another 
report  upon  the  adoption  of  the  ball  Newton  method  to  this  more  -general  problem. 

It  is  therefore  not  the  essential  purpose  of  the  following  paper  to  compute 
the  zeros  of  a  given  function  f.  Rather,  it  will  introduce  a  new  family  of  'Wv.t 
methods  and  will  investigate  their  properties. 
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A  GLOBALLY  CONVERGENT  BALL  NEWTON  .METHOD- 


Karl  L.  Nickel 


1.  Introduction 


In  what  follows  the  n-dimensional  ball  B 


used  as  the  basic  domain.  The  zeros  z  f  B  of  the  given  function  f 


wanted 


Let  the  inverse 


Let  f  be  differentiable  on  B  with  the  Jacobian  t 


exist  on  B.  Then  the  usual  Newton  me thod  can  be  defined:  The  sequence  x 


There  are  many  variations  of  (1),  (2).  In  the  simplified  Newton  method  (2)  is 


The  literature  dealing  with  the  Newton  method  is  huge.  An  introduction  is  given  b; 


L.  Rail  [8],  and  a  survey  may  be  found  in  the  book  of  Ortega-Rheinboldt  (7] 


The  methods  (1),  (2)  and  (1),  (3)  have  the  advantage  that  they  are  convergent  i; 


a  neighborhood  U(z)  of  a  zero  z  of  f.  This  means  that 


and  that  lim  x 


is  continuous  or  if 


condition,  then  the  order  of  convergence  of  (4),  (2)  is  superlinear  or  quadratic 


This  paper  was  stimulated  by  the  "Symposium  on  Analysis  and  Computation  of  Fixed 
Points”  at  the  University  of  Wisconsin-.’Iadison ,  Madison,  Wisconsin,  U.S.A.  in  "a 
of  1979.  It  was  written  while  the  author  was  visiting  the  Mathematics  Research 
Center  at  the  University  of  Wisconsin.  Address  of  the  author:  Institut  fiir 
Angewandte  Mathematik  der  Universitat,  Hermann-Herder-Str .  10,  D  7800  Freiburg  i 
West  Germany. 
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The  disadvantages  of  (1),  (2)  and  (1),  (3)  are  well-kno‘vn: 

1)  From  (1)  it  may  follow  that 

X  -i  B  (5) 

V 

for  some  ■  K.  In  this  case  the  method  cannot  be  applied  further. 

2)  Even  if  x  c  B  is  true  for  all  v  =  0,  1,  *  *  •  the  sequence  {x  }  is  not 
necessarily  convergent. 

3)  Without  additional  assumptions  on  f  there  are  no  easily  determined  a  priori 
or  a  posteriori  error  bounds. 

In  order  to  avoid  the  first  two  disadvantages,  modified  Newton  methods  are 
sometimes  considered  instead  of  (1),  (2).  For  example  the  new  rule 

x  ,  :=  X  -  Gt  (f)  ^  (x  )  f  (x  )  for  v  =  0,  1,  *  *  * 

is  often  considered  (see  Ortega- FJieinbold  [7)).  Herein  is  ^  sequence  of 

appropriate  real  numbers  often  defined  by  minimization  methods.  If  the  are 

chosen  suitably  then  (5)  can  be  avoided  and  convergence  for  all  c  B  can  be 

enforced.  This  is  called  global  convergence.  Theoretically  these  results  are  very 
satisfying.  Practically,  their  use  is  often  quite  awkward.  Because  of  this  and  in 
order  to  remove  the  above  third  disadvantage,  in  what  follows  a  new  approach  is 
pre  sented. 

The  main  idea  of  the  new  method  is  to  construct  a  whole  ball  2  ,  with  the 

-  v+1 

property  z  ■'  instead  of  determining  a  new  point  each  step  as  in  (2) 

or  (3).  Hence  in  each  step  a  new  a  priori  error  bound  is  found.  Ttie  initial  ball 
is  2^  3.  If  z  Z  B  is  chosen,  then  case  (S)  can  never  occur.  It  will  be 

shown  that  the  radii  of  the  balls  converge  to  zero  at  least  like  a  geometrical 

secjcnce.  3inco  z  Z  arid  z  Z  ,  z  -*•  z,  hence  global  convergence  occurs. 

;  V 

n  V  r  a;  :  ro:  riate  assum:  tions  on  f,  even  superlinear  or  quadratic  convergence  can 
%hown.  If  there  is  no  zero  z  of  f  in  B  then  tJic  method  stops  after  a  finite 
r  j-iy  r  of  str-ps.  T^iis  is  an  additional  advantage  of  the  new  mc'thod. 
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si":jifica\’CE  and  explanation 


On>'  of  the  main  problems  in  Numerical  Analysis  is  the  computation  of  the 
zeros  of  i  n-dimensional  function  f  :  .  For  centuries,  the  so  called 

^iewton  method  lias  been  used  and  inv'estigated.  This  meth.od  has  some  very 
important  advantages . 

i)  If  it  converges  then  its  rate  of  convergence  is  usually  quadratic, 

ii)  It  is  quite  easy  to  describe,  to  understand  and  to  program. 

There  arc,  however,  very  severe  restrictions ,  drawbacks  and  disadvantages 
to  this  method; 

i)  In  its  pure  form  it  is  not  a  method  at  all  nor  does  it  provide  an 

algorithm  since  in  general  one  cannot  predict  a  priori  if  it  will  work, 
ii)  It  provides  ’'approximations"  but  no  error  bounds,  i.e.  these  "approximations 
may  have  no  significance  at  all. 

iii)  Even  if  it  works  and  converges  (which  contrary  to  popular  belief  does  not 
occur  too  often)  it  cannot  give  any  information  wliether  a  zero  lies  in  a 
given  domain  or  if  that  domain  is  free  of  zeros. 

The  method  described  in  this  paper  avoids  all  the  above  disadvantages  with¬ 
out  losing  the  advantages  of  the  usual  Newton  method.  It  is  based  on  the 
fiuidamental  idea  of  interval  mathematics:  Instead  of  compjuting  "approximation 
points "  to  t'ne  solution  of  a  mathematical  i  roblem  without  known  error  bounds, 
one  s'--ould  try  to  evaluate  "approximation  sets "  whidi  are  guaranteed  to  contain 
t:.c  solution.  In  tlio  method  used  here,  one  of  the  most  simple  such  sets  is 
used;  namely  ijrolls.  .Ml  t!ie  disadvantages  of  the  "point"  Newton  method  disappear 
t.:  -.'j'.’n  a.s  on'-'  .~witc;ies  to  a  "ball"  ;jev.'ton  method.  Astonishingly  the  assumptioiis 

•-  : mu-l-  for  furi'T.  f  are  more  complicated.  In  a  certain  sense 

•-  -r  •  ■-'•n  "  im;  b  r"  a;,d  ".'orr'  "natural."  Instead  of  differentiability  of  f 
;  in_cl  ::uy_e,  Mpw.itz  condition  is  required. 


■  !•  •••'ording  and  views  expressed  in  this  descriptive 

in'!  :i'>t  -wit!,  t'--:c  author  of  this  report. 
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2 .  Notations  and  definitions;  balls  and  regular  ball  operators. 

2. 1.  Let  n  c  r;  be  the  fixed  dimension  number.  Let  an  arbitrary  norm  •  .oe 

chosen  on  As  matrix  norm  for  n  »  n-matrices  the  operator  norm  cor  re  soon  dir.  j 

to  il  •  II  will  be  used.  Only  real  values  will  be  considered  for  simplicity. 

2 

However,  in  examples  the  space  (C  will  be  used  also,  as  C  =  P  . 

Notations : 

1)  Lower  case  Greek  letters  (a,  S,  ‘  )  alwavs  mean  real 

V 

numbers . 

2)  Lower  case  Roman  letters  (a,  b,  '  '  •  ,  z,  z^,  ■  •  •  )  are  n-dimensional 
vectors  or  vector  valued  functions.  (Exception:  n  always  means  the  integer 
dimension  number.  ) 

3)  Capital  Greek  letters  ((ji,  E,  A,  •  •  •  )  denote  n  y  n-matrices  or  matrix 
functions.  As  usual  Ax,  EA  means  matrix  multiplication. 

4)  Capital  Roman  letters  (A,  B,  •  •  •  ,  Z,  ,  •  •  ■  )  are  n-dimensional  sets . 
in  particular  balls  or  ball  valued  functions.  (Exception:  I  denotes  the 
identity  matrix. ) 

5)  The  function  family  considered  is  denoted  by  F. 

Remark :  In  contrast  to  functional  analysis,  which  uses  different  alphabets  to 

distinguish  between  elements  and  operators,  here  different  alphabets  will  identify 
points  and  operators  which  have  values  in  different  sets. 

2.2.  Definition  (ball):  Let  z  e  e"  ,  0  <  ^  e  E.  The  set 

Z  :=  {x  e  e"||Iz  -  xII  <  ;}  (K) 

is  called  a  (n-dimensional  real)  ball  with  the  midpoint  (center)  z  and  the  radius 

Z- 
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V.otations  to  ilO^: 


Z  =  <  z,  :  )  , 

z  =  mid  2  , 

•  =  rad  Z  , 


X  ■  Z  :  <=»  II X  -  z  II  <  c. 


(11) 


The  midpoint  of  a  ball  which  is  characterized  by  a  capital  Roman  letter  will  be 
called  by  the  sanr- -  ijuf  lower  case  -  Roman  letter.  For  the  radius,  however,  the 
corresponding  small  Greek  letter  is  used.  If  G  ^  IR^  then  the  set  of  all  such  balls 
Z  G  will  be  denoted  by  K(G).  Clearly  f"  £K(IR^)  . 

Properties : 

1)  .a.ny  ball  Z  £  K  (IR*^)  is  compact  and  convex. 

2)  The  inclusion  0  £  Z  =  (z,  is  true  if  and  only  if  ^  >  II  z  II. 

Definition  (K  (IP.''‘))  :  The  set  of  all  balls  Z  £  KCIr”)  for  which  0  «  Z  holds  is 
-  no  ^ 

denoted  bv  K  (IR^). 

no 

Properties : 

1)  Let  be  Z  ■  K  (rt”)  and  (differing  from  the  notation  in  (ID)  use  the  characteriza¬ 
tion 


Z  =  (  z,  ;  l|z  ID 

then  the  following  holds:  Z  r  K  iff  z  ^  0  and  0  £  c  <  1. 

no  = 

2)  If  L  =  <  ,  ■  II .  II)  ■■  K  (  p”),  then  the  set  of  all  balls  <  u"  ,  '  llpzll)  ,  where 

no 

F,  lies  in  a  (double)  cone  with  the  vertex  at  zero  and  with  the  defining 

a.Tjle  t  arc  sin  •.  See  Figure  1  for  n  =  2. 

Definition  (ball  arithmetic):  Let  a  ^  p,  a  p",  Z  =  <z,  Dr  K(p").  Define 

.  •  Z  =  Z  •  ,  :=  <  ^z,  'a ; : >  ,  (12) 

a  +  Z-  Z  +  a:=(a  +  z,').  (13) 


urt..c.r  operations  ('’.g.  X  +  an  not  needed  in  what  follow?  and 

Ir  algo;, raid  structures  are  tlieruforo  not  investigated  here. 


Euclidean  metric.  The  set  of  balls  <uJ,i|iu--ll  ^  fot 
is  contained  in  a  (double)  cone  with  the  defining  angle 
arc  sin  X. 


0  the.n  (12)  and  (13)  are  the  usual  definitions  on  IR 


)  The  following  inclusions  are  true 


De finition  « .X  Y)):  Let  X,  Y  e  ]K(1R‘  )  and  let  X  n  Y  ^  )3.  Then  there  is  (at 


least)  one  ball  2  K(3R  )  for  which  X  n  Y  c  z  and  with  rad  Z 


min 


there  is  more  than  one,  an 


See  Zigure  2  for  n  =  2  and  for  the  Euclidean  metric 


'oilowing  inclusions  and  inequalities  are  true 


X  and  Y  is  shaded.  Z  =  <x  n  Y>  is  the  smallest  ball  containing 
X  n  Y. 


-•3.  Definition  (regular  ball  operator):  Let  A  be  a  regular  n  x  n-matrix,  and 
•  e  R  with  0  <  A  <  1.  The  regular  ball  operator  L  is  defined  by  the  ball 
valjed  operator 

L  :  ]r"  -*•  K  (m")  u  {0}  , 

no 

Lx  :=  <  Ax,  XII  Ax  ID  for  x  €  ]r" 

Notations :  The  operator  L  will  also  be  denoted  by 

L  =  <(A,  X>)  . 

Remarks : 

1)  In  what  follows,  more  general  ball  valued  operators  S  :  r"  ->'K(Ir")  are  not 
needed. 

2)  Any  ball  operator  can  also  be  considered  as  an  operator  ball  according  to 

L  =  (<  A,  X  >> 

n  (■ 

:=  {i|>  n  X  n-matrix  |ll  (i|)  -  A)xll<  XllAxll  for  all  x  eR 

i)  This  view  and  formula  (15)  are  equivalent  to 

’ix  c  Lx  :=  <  Ax,  XII  Ax  ID  for  all  x  e  r"  .  (16) 


4)  The  inclusion  (16)  will  be  abbreviated  to 


Properties ;  For  any  regular  ball  operator  L  the  following  is  true  ; 

1)  0  I  Lx  for  X  ^  0  . 

2)  Ip  €  L  =((  A ,  X>>  if  and  only  i  f 

-  I  II  =  II  (*  -  II  i  >  • 

This  means  that  the  relative  error  between  ip  and  A  is  bounded  by  /  ^  1 

3)  If  e  L  =  <<A,  X>>  then  'p  is  a  regular  matrix  and  for  the  inverse 
following  inequality  holds  with  cond  A  :=  II  Ml  HA  ^11  ; 

iii^'^A  -  I II  =  II  -  a"^)a  II  <  j— ry  ■  • 

This  means  again  that  the  relative  error  between  ^  and  A  ^  is  bounded 
addition 

^  1  +  cond  A 

holds  then  the  right  hand  side  of  (17)  is  smaller  than  one,  i.e. 

,-l  .  ,,.-1  X 

*  "  H  == 

and  is  a  regular  ball  operator. 


cond  A  ) ) 


3.  T:\e  class  of  functions 

3.1  Definition  (class  F);  Let  the  class  F  of  functions  be  the  set  of  all 

functions  f  ;  B  •  which  satisfy  the  following  property:  For  each  set  C 
of  the  form 

C  =  <a,  .)  B  (19) 

with  a  B,  there  exist  a  regular  n  <  n-matrix  A  =  A(C)  and  a  real  number 
^  (C)  such  that 

0  <  ■  <  1  (20) 

and  for  all  x,  y  C, 

llx  -  y  -  *.{f(x)  -  f{v))!t  <  X!!A(f(x)  -  f(y))li  .  (21) 

If  there  is  more  than  one  ■  (C)  and/or  .‘.(C),  an  arbitrary  one  is  selected  and 
attached  to  F. 

Remarks ; 

1)  By  using  the  regular  ball  operator 

L  :=  <<  A,  >  >>  (22) 

the  inequalities  (20),  (21)  can  also  be  written  as 

x-y  L(f(x)  -f(y))  for  x,  y  •  C  .  (2  3) 

2)  If  f  F  then  the  inverse  f  ^  of  f  exists  on  the  set  f(B)  :=  '  f  (x)  |x  e  B}. 
Hence  by  defining  u  :=  f(x),  v  f(y)  the  inclusion  (2  3)  can  also  be  written  as 

(u)  -  (v)  •  L(u  -  V)  .  (24) 

From  (34)  the  class  F  of  functions  can  also  be  characterized  by  the 
Equivalent  Definition  (F):  The  class  F  consists  of  the  set  of  all  functions 

f  :  B  ^  IP.^  for  which  f  ^  exists  on  f(B)  and  which  satisfy  the  inclusion 
Lipschitz  condition  (24)  on  C  defined  by  (19). 

3.2.  With  these  results  and  with  those  of  52,  the  functions  f  r  F  have  the  following 
Properties ; 

1)  'n'.ore  is  a*-,  most  one  zero  z  of  f  F  in  B. 

On  C  defined  by  (K^),  any  fiinction  f  F  satisfies  the  Lipschitz  condition 
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(1  + 


f  (x)  -  f  (V)  <  llx  -  yll  . 


Honce,  f  is  continuoas  on  B.  There  are  similar  Lipschitz  conditions  for  f 

3)  Let  f  F  be  Frechet  differentiable  at  the  point  x  B  and  let  (x)  be 

the  Jacobian.  Then  ^  (x)  exists  and  is  bounded  by 

<  II  (*•  )'^  (x)ll  <  (1  +  .X)IIMI  . 

Il/,■•^ll  ' 

Moreover,  the  relative  error  between  A  and  (<>')"^  can  be  bounded  by 
ll((r)"^(x)  -  A)A”’'II  =  II  (r  )’^  (x)A*^  -  III  <  X  . 

4)  Let  f  ■-  F  be  Frechet  differentiable  on  the  entire  domain  C  defined  by  (19). 


rhen  for  all  x,  y  c  C 


li’  (X)  -  (y)li  <  IIA'^II 

~  i  “  ^ 


5)  The  result  of  4)  implies;  Assume  that  for  all  sets  C  :=  A  n  B  where 
a  =  mid  A  £  B  the  following  is  true: 


(OH  <  u  , 


.\(C)  <  Xq  -  1  , 


for  aopropriate  real  values  u .  X  <  B.  If 

0 

X  (C)  -*  0  for  rad  A  -►  0  ,  (26 

the..  is  continuous  on  B  and  if  there  exists  a  c  B  such  that 

>  (C)  <  n  •  rad  A  ,  (27 

then  is  Lipschitz  continuous  on  B. 

Criteria  for  f  F, 

Assame  that  there  exists  a  matrix  1  =  'Mx,  y)  (=  C,(y,  x) )  for  all  x,  y  c  C 


defined  b"  (19)  such  that 


f  (x)  -  f  (y )  =  ;.  (x  -  y ) 


3 


This  is  certainly  true  if  f  satisfies  one  of  the  mean  value  theorems  on  C  (cc- 
Ortega-Rheinboldt  [7)).  In  general  there  are  infinitely  many  such  matrices.  It  A'iH 
be  shovm  in  §4  how  to  construct  such  matrices  Ji  which  are  continuous. 

3. 3.1.  Let  there  exist  ij;  ^  =  iji  ^  (x,  y )  on  C  and  let  the  inclusion 

Ip  ^(x,  y)  e  L  (29) 

be  satisfied  for  all  x,  y  e  C  with  the  regular  ball  operator  L  =  <(.'.,  ■  >  >.  Thien 
(21)  is  true,  i.e.  f  e  F. 

3.3.2.  Let  there  exist  an  n  x  n-matrix  A  and  a  real  constant  0  <  >  <  1  such  that 
for  all  X,  y  e  C 

III  -  Aifill  <  X/(l  +  X)  .  (30) 

Then  A  and  \Ji  (x,  y)  do  exist  on  C,  the  operator  L  :=  ((A,  >  >>  is  regular 
and  i|/  ^  e  L.  Hence  f  e  F  by  3.3.1. 

3. 3. 3.  Let  f  be  continuously  Frechet  differentiable  on  C  with  the  Jacobian 
4)'  =  <>'  (x,  y).  For  X,  y  £  C  define: 

1 

4:(X,  y)  :=  /  41'  (x  +  t(y  -  x))dt  .  (31) 

0 

From  the  Mean  Value  Theorem  and  because  C  :=  A  n  B  is  convex,  the  following  two 
additional  sufficient  conditions  arise: 

3.  3.  3.1.  Let  there  be  a  regular  n  x  n-matrix  A  and  a  real  number 
0  <  a  <1/(1+  cond  A)  such  that  for  all  x  e  C: 

41'  (x)  £  L^  :=  <<  A”^,  o  >>  . 

Then  also  ip(x,  y )  £  L^  for  all  x,  y  £  C,  furthermore  V  ^  (x,  y)  exists 

and  with  the  definition  X  :=  ^ - —  cond  A,  the  inclusion  0  ^  ■  L  :=  <(  ’  , 

holds  by  (18).  Then,  f  e  F  by  (29). 


(32) 

on  2 , 

>> 
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3.  3. 3.2.  Let  there  be  an  n  ^  n-matrix  and  a  real  number  0 


for  all  X  e  C: 


ll  -  A({i'  (x)ll  <  1/(1  +  X)  . 


Then  (30)  holds,  so  A  ^  and  'p  ^  (x,  y)  exist  for  x,  y  f  C  and  f  F  holds. 

3.4.  Examples :  Itie  case  n  =  1  and  the  complex  case. 

The  two  spaces  F  and  C  can  l)e  treated  together.  As  usual  C  is  imbedded  x 

2 

IR  ;  furthermore  on  both  F  and  C  the  Euclideein  norm  is  used.  In  what  follows, 
the  case  r.  =  1  is  shown;  the  notations  for  C  are  then  added  in  square  brackets: 

The  ball  Z  =  <z,  c)  is  an  interval  [a  disc].  If  0  ^  Z  then  Z  ^  exists  an 
is  an  interval  [a  disc]  which  can  be  written  as 

_-l  |z|^  ,  -1  .1  1-2  , 

Z  =  — «  -IH  >  • 

\z\  -  t 

If  X  £  Z  then  also  x  ^  e  Z 

The  application  of  a  regular  ball  operator  S  =  <<1,  o>>  to  an  element  x  ca.-. 

be  interpreted  as  real  [complex]  multiplication  of  the  real  interval  <1,  r>  = 

=  [1-0,  l+o]  by  0^S€Fl£(D)  and  x  f  K[c  I]  according  to 

Sx=<Sx,  a|sx|>=I*<l,o>>x  .  (33) 

For  functions  f  :  B  ->-F[C]  and  C  defined  by  (19)  one  has  the 
Criterion :  Qi  F[C]  the  property  f  c  F  is  true  if  and  only  if  f  satisfies  on 
any  set  C  the  inclusion  Lipschitz  condition 

f(x)  -  f(y)  e  S(x  -  y)  for  x,  y  t  C  .  (8) 

Herein  S  =  £  •  (1,  o  >  is  defined  according  to  (33)  with  0  ^  £  c  F[  I]  and 

0  <  a  <  1.  The  ball  operator  L  =  <<A,  X>>  in  (21)  and  (23)  is  defined  by 

A  :=  £‘V(1  -  0^)  ,  X  :=  a  .  (34) 

If  f  is  differentiable  on  C  [hence  holomorphic]  then  (8)  is  satisfied  for 

(^'(z)  c  Z  •  o>  for  z  e  C  ,  (35) 


The 
circles ) 


proofs  are  elementary  fon  (T  for  example  by  using  the  inversion  of  complex 
and  will  be  omitted. 
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ball  Newton  otH~>rator  11. 

Definition  (ball  tiewton  operator):  Let  f  ■  F,  A  =  <a,  a  >  c  KdR^)  with  a  •  B. 
L  =  <(  ’,  ■'  )  >  be  the  regular  ball  operator  corresponding  to  C  :=  A  n  B 
according  to  (22).  Define  the  ball  Newton  operator  N  by 

N  :  C  >•  lK(Iv")  ,  1 

I 

Nx  :=  X  -  Lf (x)  I 

=  <x  -  Af  (x)  ,  XIIAf  (x)ll  >  for  X  .  C  .  j 

Obviously  N  =  N(C),  this  dependence  will  be  observed  and  used  later  occasional 
Properties : 

1)  From  (13)  and  by  (11)  it  follows  that 

mid  Nx  =  X  -  Af (x) 


Let 


(36) 


ly. 


and  (37) 

rad  Nx  =  XII  Af  (x)  II 

Hence  both  mid  Nx  and  rad  Nx  are  continuous  on  B. 

2 )  If  z  £  C  then : 


f(z)  =  0  if  and  only  if  z  =  N(z) 

Hence  the  zeros  of  f  are  on  C  exactly  the  fixed  points  of  the  operator  N. 
3)  Test  N  (Non-existence):  If  x  •  C  and 

C  n  Nx  =  0 

t?«n  there  exists  no  zero  of  f  on  C. 


4) 

Test  E  (Existence) 

:  Let 

f  £  F 

and  assume  that  f  is  Frechet  differentiable 

on 

C.  Let  there  be  one 

point 

X  c  C 

such  that 

Nx  ^  C  .  (38) 

Then,  f  has  (exactly)  one  zero  z  c  C. 

5)  If  X,  z  C  and  f(z)  =  0  then 

z  e  Nx  .  (39) 

This  means:  By  applying  the  Newton  operator  N  to  an  arbitral,'  point  x  e  C  one 
gets  an  error  bound  for  the  zero  z  of  f  in  C.  In  other  words:  By  applying  the 
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Newton  operator  N  no  zero  z  of  f  can  "get  lost."  However,  x  (  '.'.x  if 
X  /  z. 

6)  Let  there  exist  a  sequence  {x  }  with  x  f  C  for  u  K  amd  with  lim  x 

V  V 

where  f(z)  =  0.  Then  the  following  is  true: 


z  £  Nx 


for  u  t  U 


lim  mid  Nx 

V 


and 


(4  ) 


lim  rad  Nx 
^-►00 


(41) 


This  means:  The  application  of  the  Newton  operator  N  to  a  convergent  sequence 
(x^l  gives  a  sequence  of  error  bounds  tNx^}  which  converges  also  in  the  sense  of 
(40)  and  (41). 

The  proofs  of  the  properties  1)  to  3)  and  5),  6)  will  be  omitted.  Only  the 
proof  of  Itest  E  will  be  given.  The  basic  idea  of  this  proof  has  already  been  used 


in  an  earlier  paper  [6].  Define  x  =  (C, 


’n>'  y  =  '"l' 


), 


f  -  (v-^,  •  •  •  - 


;  :*  /3C  •  Let  the  matrix  *  0(x,  y)  have  the 

MV  M  V  r  r  »  / 


components  =  .^^^(5^,  Cj,  •  •  •  ,  5^,  n^, 


n  )  for  V  =  1  (l)n. 
n 


Let  the  if/  be  defined  by  the  following  divided  differences 


-  % 


for  i  ^ 


yv  1  v-l  v  v+l  n  V  V 

The  identity  (28)  is  satisfied  by  the  construction  of  If  one  of  the  arguments 

X  or  y  is  fixed,  then  :Kx,  y)  is  continuous  with  respect  to  the  other  argument 

y  or  X  on  C.  Because  of  f  c  F,  the  inverse  ip  ^  (x,  y)  exists  for  all 

X,  y  £  C.  Furthermore,  p  ^  (x,  y)  is  also  partially  continuous  with  respect  to 
xor  y  on  C.  Let  X£C  be  fixed.  Define 
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g(y)  :=  y  -  v‘^(x,  y)f(y)  .  (-52) 

The  function  g  is  continuous  for  y  C.  Furthermore,  by  (28)  the  identity 

g  (y )  =  X  -  g;  ^  (x,  y  )f  (x) 

holds.  By  using  (36),  (29)  and  (38)  one  then  gets 

g(y)  £  X  -  Lf  (x)  =  IJx  £  C 

Hence  the  continuous  function  g  maps  the  convex  and  compact  set  C  into  itself. 

By  the  Schauder  fixed  point  theorem,  there  is  therefore  at  least  one  fixed  point 
y  =  g(y)  e  C.  But  because  of  (42)  this  fixed  point  y  of  g  is  also  a  zero  of  the 
function  f.  ■ 
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3 .  The_sunj2l_^£ied_bal_lj  Newton_al20£it)™__SM . 

S  ■  1  ■  Sujt'ose  f  F. 

Problem: 

i)  Is  there  a  zero  z  of  f  on  B? 
ii)  If  yes,  compute  z  in  a  constructive  way. 

How  to  proceed:  A  simplified  ball  Newton  algorithm  SNA  will  be  presented.  Starting 
with  :=  B  successive  balls  Z^,  Z^,  ‘  ’  •  will  be  constructed  with  the  midpoints 

z  :=  mid  Z  t  B  for  v  =  0,  1,  •  •  •  . 

Ei the r  the  algorithm  stops  after  a  finite  number  of  steps.  This  is  true  if  and  only 
if  f  does  not  have  a  zero  z  €  B. 

Or  the  construction  can  be  continuated  indefinitely.  This  happens  if  and  only  if 
there  is  a  (uniquely  determined)  zero  z  of  f  on  B.  Then  the  midpoints  z^  con¬ 
verge  to  that  zero  z.  Furthermore,  at  each  step  the  error  inclusion  z  e  Z  holds 

V 

and  the  radii  converge  to  zero. 

During  this  simplified  algorithm  SNA  the  fixed  initial  parameters  A(B),  1(B) 
are  always  used,  notwithstanding  the  possibility  that  later  on  tetter  data  A(Z^), 

■ (Z^)  could  be  available.  Hence  SNA  corresponds  to  the  simplified  point  Newton 
method  (1),  (3).  Li)ce  the  latter  it  is  only  linearly  convergent.  A  more  general 
Newton  algorithm  NA  with  updating  in  each  step  will  later  be  presented  in  §6.  It 
corresponds  to  the  general  point  Newton  method  (1),  (2)  and,  therefore,  is  super- 
linearly  or  even  quadratically  convergent. 

5.2.  Preliminary  remar)t.  Let  A  =  <  a,  «  >  be  a  ball  and  relative  to  the  midpoint 
a  define  the  new  ball  Na  :=  a  -  Lf(a)  with  the  Newton  operator  N.  Then,  depending 
on  the  value  of  llf(a)ll,  different  pjossibilities  arise  as  s)cetched  in  Figure  3  for 
n  =  2  and  the  Euclidean  norm.  As  long  as  II  f  (a)ll  is  very  small,  the  ball  Na  is 
completely  contained  in  A.  For  growing  values  of  llf(a)ll  the  radius  of  Na  expands. 
Tile  i.'itersection  /,  :Ja,  which  is  shaded  in  Figure  3,  is  first  growing  but  shrinlcs 
later  and  finally  disappears  completely.  Let  5  loe  the  diameter  of  the  set  A  n  Na. 
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The  maximum  is  reached  in  figure  3c).  By  constructing  the  smallest  ball  A  :: 
which  contains  the  set  A  n  Na  (this  ball  is  dashed  in  Figures  3c)  to  3e  )  )  ,  one 
therefore  gets  for  u  ^  0 

■5/2  =  rad  (A  n  Na  )  <  ‘.n  •'  i 


Kindly  note  that  the  midpoint  of  the  ball  (A  n  Na  )  always  lies  in  A  (and  in  Na 
too).  This  follows  from  (14). 

5 . 3.  Definition  of  the  algorithm  SNA. 

Since  f  e  F,  there  are  constemts  MB),  A  (B)  satisfying  (20),  (21).  With 
these  constants  the  fixed  Newton  operator  N  is  defined  by  (36).  The  algorithm  S:;A 
is  defined  recursively.  Since  the  initial  ball  :=  B  has  an  exceptional  position, 
the  determination  of  is  different  from  that  of  Z^ ,  2^,  •  •  •  . 


Initial 

step 

:  Define 

"o 

:=  B  = 

'  furthermore  the  ball  Nz^ 

1. 

If 

n  Nz 

0  0 

=  0 

then 

the  algorithm  is  stopped. 

2, 

If 

n  Nz 

0  0 

r‘  0 

then 

define 

Z,  :=  <  Z„  n  Nz„  >  . 

10  0 


(44) 


Continuation  step:  Assume  that  Z^  for  v  >  1  is  already  defined  with  tl,e 

property  z  =  mid  Z  e  Z„.  The  ball  Nz  is  derived  from  z  . 

V  V  0  u  V 

1.  If  Z^  n  Nz_^  =  0  then  the  algorithm  is  stopped. 

2.  If  Z  n  Nz  ^0  then  a  new  preliminary  ball  Z*  ,  is  constructed  by 

V  V  ‘  \)  +  l 

z;^l  :=  <Z^  n  NZ.^>  .  (45) 


2.1. 

If  Z*^^  0  Z^  =  0 

then  the 

algorithm  is 

Stopped 

2.2. 

Let  Z*  ,  n  Z„  0 

and  let 

z*  ,  :=  mid 

Z*  . 

v+1  0 

v+1 

v+1 

2.2.1.  If  z*  ,  € 
v+1 

Zg  then 

define 

(4’;  ' 


(4-) 
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5 .4.  Theorem  1 ;  Suppose  f  e  F.  TTie  function  f  has  no  zero  in  B  if  and  onl/ 
the  algorithm  SNA  stops.  The  function  f  has  a  (uniquely  determined)  zero 
z  e  B  if  and  only  if  the  algorithm  SNA  can  be  continued  indefinitely.  Iri  cr.is 
case  it  yields  a  sequence  balls  fot  which 

lim  z  =  z  €  B 

V 

v-x* 

and  (‘59) 

z  c  Z  for  y  =  0 ,  1 ,  •  •  • 

u 

The  sequence  of  the  ball  radii  converges  at  least  linearly  to  zero  according  to 

lim  =  0  , 

C  .  <  Xz  for  V  =  0,  1,  •  •  •  .  (■59) 

y+1  =  u 

Proof : 

1)  If  X  =  0  then  f  is  a  linear  function  by  (21).  In  this  case  the  application 
of  the  loan  Newton  operator  N  to  z^  yields  the  point  z.  If  z  ^  B  then  the 
algorithm  stops  at  the  initial  step.  If  z  e  B,  then  the  algoritlun  never  stops  and 
gives  the  result  =  <z,  0>  for  all  v  =  1,  2,  •  •  •  .  In  what  follows,  therefore 
X  >  0  can  be  assumed  with  loss  of  generality. 

2)  Assume  that  there  is  a  (uniquely  determined)  zero  z  of  f  on  Z^  :=  B.  Then 
z  £  NZjj  by  (39).  Therefore,  z  £  Zj^  also,  where  Z^^  is  defined  by  (44).  Hence, 
the  algorithm  is  not  stopped  during  the  initial  step. 

Assume  that  it  is  already  proved  that  z  £  Z^.  By  construction,  z^  Z^. 

Therefore,  z  £  Nz  by  (39)  eind  this  inrolies  z  e  Z*  ,  ,  where  Z*  ,  is  defined  C': 

V  v+l  .'+1 

(45).  Hence,  neither  Z  n  Nz  =0  nor  Z*  ,  n  Z  =0  and  therefore  the  algorithm  ( 
V  u  v+l  0 

not  stopped  at  the  u-th  step  either. 

This  is  true  for  all  v  =  0,  1,  •  '  •  and  means:  If  z  e  B,  then  the  ball 
Newton  algorithm  SNA  does  not  stop. 

3)  Conversely,  assume  now  that  the  algorithm  SNA  does  not  stop.  It  is  to  kc  shown 

that  the  sequence  °f  the  Isall  mit^oints  converges  to  a  limit  point  z  B. 
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Initial  step:  From  the  definition  (44),  one  deduces  because  of  c  and  tjecause 

of  (4  3)  the  two  inequalities 


!lz  -  z  <  : 

1  0  =  0' 


1  =  0  . 


(50) 

(51) 


Continuation  step:  &/  inserting  (4  3)  in  the  definition  (45)  and  observing  (14)  one 
gets 

(52) 


!lz*  ,  -  z  II  <  ^ 

V  +  x  -j  ~  V 


<  H  .  (53) 

v+1  =  V 

In  the  case  2.2.1,  because  of  (46)  this  gives 

Vl  i  '’V  • 

Ln  t.ie  case  2.2.2.,  one  deduces  from  the  definition  (47)  by  observing  (14)  tlie 
inequalities 


Iz  ,  -  z*  II  <  ^*  , 

v+1  u+1  =  v+1 


(56) 

^+1  S  '‘v+1  ■ 

The  inequalities  (52),  (56)  and  (53)  together  with  the  triangle  inequality  therefore 


yield 


II  z  ,  -  z  II  <  r,  +  t*  , 

v+l  V  =  V  u+1 


<  I  (1  +  X) 
=  u 


(58) 


The  inequality  (57)  together  with  (53)  lead  to  the  previous  inequali ty  (55 ) . 

These  proven  inequalities  (51)  for  v  =  0  and  (55)  for  v  >  1  are  exactly  the 
inequality  (49)  claimed  in  Theorem  1.  By  using  induction  one  gets  from  this  that 


=  '0 


for  V  =  0,  1',  •  •  ■ 


(59) 


5.'  combining  the  inequalities  (50),  (54)  and  (58)  and  by  observing  (59),  one  gets  for 
all  =■  0 ,  1 ,  ■  •  •  tix;  bounds 
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a)  the  condition  (26)  or 

b )  the  condi  tion  (27). 

Then,  all  the  statements  of  Theorem  1  also  remain  true  for  the  algorithm  ;JA.  Ir. 
addition  to  (49)  the  following  holds:  The  convergence  of  the  ball  radii  '  is 
either 


a)  superlinear  or 

b)  quadratic. 

The  proof  follows  iitmediately  from  the  definitions  (25)  to  (27). 
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7. 


Examples . 

The  following  two  examples  have  been  chosen  deliberately  to  be  simple.  The 
2 

space  C  =  li  is  used.  In  both  cases,  the  function  f  :  I  --  I  is  a  quadratic 
polynomial,  and  its  zeros  are  therefore  known.  Furthermore  the  derivative  :  '  of 
f  is  linear.  If  C  e  Kdl)  is  a  disc  then  the  set  (C)  :=  ■  (z)  z  ■  C  is  also 

a  disc.  Hence  the  condition  (35)  can  be  verified  extremely  easily. 

7.1.  Non-existence  example. 

Let  f(z)  :=  (z  -  3  +  2i)  (z  -  3  -  2i)  =  z^  -  6z  +  13  and  B  :=  <0,  2). 

Clearly  (B)  =  (-6,  4  >.  By  (34),  (35)  one  gets  therefore  f  c  F  with  (B)  =  -3/1 

X(B)  =  2/3. 

By  using  the  simplified  ball  Newton  algorithm  SNA  one  gets  (see  Figure  4): 

=  <0,  2>,  NZq  =  (3.9,  2.6  >,  =  (  83/52,  /3927/52  >,  NZj^  =  (  18319/5408,  3229/2704  ) 

£<3.4,  1.3).  Hence  n  Nz,  =  0  and  the  algoritlim  SNA  is  stopped.  By  the  non 

existence  test  N  and  Theorem  1  there  is  therefore  no  zero  z  of  f  in  Z^^  =  B. 

7.2.  Existence  example. 

Let  f(z)  :=  (z  -  1 )  (z  -  4)  =  z^  -  5z  +  4  and  B  :=  (0,  2  >.  Clearly 
(B)  =  (-5,  4  >.  By  (34),  (35)  one  therefore  gets  f  £  F  with  A(B)  =  -5/9  a.nd 

X  (B)  =  4/5.  The  discs  Z^  :=  B,  Z^,  •  ■  •  ,  Z^,  Nz^,  ■  •  •  ,  Nz^  of  the  algorithm 

SNA  are  sketched  in  Figures  5  emd  6.  One  sees  immediately  that  Nz^^  £  Z^.  Hence  by 
the  existence  test  E  there  exists  (exactly)  one  zero  z  £  Nz^^  £  =  B.  The  speed  of 

convergence  of  the  simplified  algorithm  SNA  is  very  low.  See  the  values  in  table  1 
where  the  discs  Z^  to  Z^  are  given.  After  4  iterations  not  even  the  first  digit 
of  z  is  ensured. 

The  circles  (f'  (Z^)  are  sketched  in  Figure  7  for  v  =  0,  1,  2.  One  sees  that 

0  £  <li'  (Zj^)  and  that  (J>'  (Z^)  n  <(i'  (Z^^)  =  O'  (Z^).  Hence,  the  inprcived  algorithm  N.X 

cannot  give  better  values  than  the  algorithm  SNA  before  v  =  2.  Starting  with  the 
index  v  =  2,  the  results  are  actually  much  better  due  to  the  inclusion 
O'  (Z^)  c  o'  (Zq).  This  can  lie  seen  from  the  values  in  table  1  for  the  discs  to 


-25- 


r'rom  the  values  of  the  radii 


3'  ’4 


one  can  already  anticipate  quadratic  ccvi- 


verqence.  After  4  iterations  the  quadratically  convergent  algorithm  NA  already 
guarantees  nearly  5  digits  of  z.  After  one  more  iteration  (which  is  not  included  in 
table  1)  even  10  digits  can  be  guaranteed. 

Table  1 .  Comparison  of  the  discs  to  Z  created  by  the  algorit.hms  SNA  and  NA 

for  the  example  7.2. 

Simplified  ball  Newton  algo-  m  Ball  Newton  algorithm  NA, 

I  rithm  SNA,  linearly  convergent  |l  quadratically  convergent 


^0 

1 

1  ( 0 

1 

,  2 

-tt;; 

,  2 

> 

1 

,  1.519  •  • 

•  >  II 

,  1.519  •  •  • 

) 

^2 

j  <0.85 

,  0. 36 

>  jj  <0.85 

,  0.36 

> 

"3 

j  <1.1125 

,  0.21 

>  ;!  <  1.000338  ■  ■  • 
l| 

,  0.032801  •  • 

-  > 

^4 

j  <0.9320  •  •  ■ 

■  ,  0.144  3  •  • 

•  >  ||  <0.99999979  •  •  ■ 

.  ^  .li _ _ 

■  ,  0.00000740  • 

•  •  > 

^'igure  4.  Example  7.1  The  four  discs 


Nz^ 


Nz, 


is  shaded.  Since 


Nz, 


0'  "“O'  “1'  *'“1 
=  0  there  is  no  zero 


The  set  Z^  n  Nz 
0  0 


z  .  z 
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